Corpus: fra-ch_web_2018_1M

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 34114 d-
2 26795 l-
3 24178 S-
4 22260 C-
5 21782 c-
Top Character Bigrams
word rank frequency n-gram
1 10209 l’-
2 9659 co-
3 9653 d’-
4 9615 l'-
5 9030 d'-
Top Character Trigrams
word rank frequency n-gram
1 4131 con-
2 2556 pro-
3 2380 l’a-
4 2315 l'a-
5 2195 d’a-
Top Character 4-Grams
word rank frequency n-gram
1 1285 anti-
2 1150 cont-
3 1116 inte-
4 1055 http-
5 999 comp-
Top Character 5-Grams
word rank frequency n-gram
1 922 inter-
2 881 http:-
3 708 anti--
4 650 trans-
5 648 contr-
8785 msec needed at 2018-11-23 20:34